Application of the A* Algorithm to Solve the Longest Common Subsequence from Fragments Problem

نویسندگان

  • Yu-Mei Pan
  • Chia-Tung Lee
چکیده

Finding longest common subsequence (LCS) is a common problem in Biology informatics. The problem is defined as follows: Given two strings X=x1x2...xm and Y=y1y2...yn, find a common subsequence L=l1l2...lp of X and Y such that p is maximized. In this paper, we discuss a variation of the LCS problem – LCS from fragments problem defined as follows: Given two strings X and Y and a set M of fragments which are matching substrings of X and Y, find a LCS from M. A new method using a tree searching strategy, A* algorithm, is proposed in this study for the LCS from fragments problem. The method can help us to filter out some fragments which wouldn’t appear in solutions, and efficiently find a solution. However, in worst cases, all fragments are needed to be computed in the solving process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Dynamic Programming for Longest Common Subsequence from Fragments

Sparse Dynamic Programming has emerged as an essential tool for the design of efficient algorithms for optimization problems coming from such diverse areas as computer science, computational biology, and speech recognition. We provide a new sparse dynamic programming technique that extends the Hunt–Szymanski paradigm for the computation of the longest common subsequence (LCS) and apply it to so...

متن کامل

Construct, Merge, Solve and Adapt: Application to the Repetition-Free Longest Common Subsequence Problem

In this paper we present the application of a recently proposed, general, algorithm for combinatorial optimization to the repetitionfree longest common subsequence problem. The applied algorithm, which is labelled Construct, Merge, Solve & Adapt, generates sub-instances based on merging the solution components found in randomly constructed solutions. These sub-instances are subsequently solved ...

متن کامل

A simple algorithm for the constrained sequence problems

In this paper we address the constrained longest common subsequence problem. Given two sequences X , Y and a constrained sequence P , a sequence Z is a constrained longest common subsequence for X and Y with respect to P if Z is the longest subsequence of X and Y such that P is a subsequence of Z. Recently, Tsai [7] proposed an O(n ·m · r) time algorithm to solve this problem using dynamic prog...

متن کامل

Computing a Longest Common Palindromic Subsequence

The longest common subsequence (LCS) problem is a classic and well-studied problem in computer science. Palindrome is a word which reads the same forward as it does backward. The longest common palindromic subsequence (LCPS) problem is a variant of the classic LCS problem which finds a longest common subsequence between two given strings such that the computed subsequence is also a palindrome. ...

متن کامل

An Effective Branch-and-Bound Algorithm to Solve the k-Longest Common Subsequence Problem

In this paper, we study the Longest Common Subsequence problem of multiple sequences. Because the problem is NPhard, we devise an effective Branch-and-Bound algorithm to solve the problem. Results of extensive computational experiments show our method to be effective not only on randomly generated benchmark instances, but also on real-world protein sequence instances.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005